A Scalable Two-Phase Top-Down Specialization Approach for Data Anonymization Using Map Reduce on Cloud

نویسنده

  • R. Thaayumaanavan Bharath
چکیده

: More number of users requires cloud services to transfer private data like electronic health records and financial transaction records. A cloud computing services offers several flavors of virtual machines to handle large scale datasets. But centralized approaches are difficult in handling of large datasets. Data anonymization is used for privacy preservation techniques. It is challenged to manage and process such large-scale data within a cloud application. A scalable two-phase top-down specialization (TDS) approach to anonymize large-scale data sets using the Map Reduce framework on cloud. It is used to investigate the scalability problem of large-scale data anonymization techniques. These approaches deliberately design a group of innovative Map Reduce jobs to concretely accomplish the specialization computation in a highly scalable way. The Top-Down Specialization process speeds up the specialization process because indexing structure avoids frequently scanning entire data sets and storing statistical results.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Big Data Processing with Privacy Preserving Using Map Reduce on Cloud

A large number of cloud services require users to share private data like electronic health records for data analysis or mining, bringing privacy concerns. Anonymizing data sets via generalization to satisfy certain privacy requirements such as k-anonymity is a widely used category of privacy preserving techniques. At present, the scale of data in many cloud applications increases tremendously ...

متن کامل

An Effective Specialization Approach for Data Anonymization Using Map Reduce on Cloud

Data privacy preservation is one of the most dispersed concerns on the modern business. Data sequestration concern urgency to be consigned crucially before data sets are communal on a cloud. Data anonymization refers to as concealing complicated data for heirs of data records. In this paper interrogate the complications of big data anonymization for privacy conservation from the context of scal...

متن کامل

A Scalable Two-Phase Top-DownSpecialization Approach for Data Anonymization Using MapReduce on Cloud

A large number of cloud services require users to share private data like electronic health records for data analysis or mining, bringing privacy concerns. Anonymizing data sets via generalization to satisfy certain privacy requirements such as k-anonymity is a widely used category of privacy preserving techniques. At present, the scale of data in many cloud applications increases tremendously ...

متن کامل

Generalized Approach for Data Anonymization Using Map Reduce on Cloud

Data anonymization has been extensively studied and widely adopted method for privacy preserving in data publishing and sharing scenario. Data anonymization is hiding up of sensitive data for owner’s data record to avoid unidentified Risk. The privacy of an individual can be effectively preserved while some aggregate information is shared to data user for data analysis and data mining. The prop...

متن کامل

Data Anonymization of Vertically Partitioned Data Using Mapreduce on Cloud

In the world of computers, cloud services, on large scale, are being offered by service providers. User wishes to share some private information that has been stored on the cloud server due to various reasons such as data analysis, data mining and so on. These things bring up a concern about privacy. Privacy preservation may be attained by Anonymization data sets via normalization for satisfyin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015